Information Extraction from Helicopter Maintenance Records as a Springboard for the Future of Maintenance Text Analysis
نویسندگان
چکیده
This paper introduces a novel application of information extraction techniques to extract data from helicopter maintenance records to populate a database. The goals of the research are to preprocess the text-based data for further use in data mining efforts and to develop a system to provide a rough analysis of generic maintenance records to facilitate in the development of training corpora for use in machine-learning for more refined information extraction system design. The Natural Language Toolkit was used to implement partial parsing of text by way of hierarchical chunking of the text. The system was targeted towards inspection descriptions and succeeded in extracting the inspection code, description of the part/action, and date/time information with 80.7% recall and 89.9% precision.
منابع مشابه
Prediction of The Pavement Condition For Urban Roadway A Tehran Case Study (RESEARCH NOTE)
This report is the result of a research project on a pavement management system that was preformed by the Transportation Division of Iran University of Science and Technology. Information used in the project was collected from 20 zones of the Tehran Municipality. Any maintenance and repair system for roads is normally compared of a number of general and coordinated activities in conjunction wit...
متن کاملEvaluation of the Effects of Maintenance and Rehabilitation Projects on Road User Costs via HDM-4 Software
Rapid growth in a number of vehicles on roadways accelerates pavement deterioration trends. Pavement inefficiency in carrying the applied load from passing vehicles results in spending significant costs on continues Maintenance and Rehabilitation (M&R) treatments. Lane closure owing to the implementation of M&R operations incurs enormous costs on road users. The research aimed to calculate, and...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملLifelong , self-directed learning and the maintenance of competence: the triple helix of continuing professional development
Abstract It has been proposed that we think of continuing medical education (CME) as a two-stranded helix, in which one strand represents the internal characteristics of the learner-physician, the other strand the culture and environment in which he or she practices and lives. In many countries, the product of these two strands has been increasingly termed ‘continuing professional development’...
متن کاملData Extraction using Content-Based Handles
In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...
متن کامل